A self-consistent approach to measure preferential attachment in networks and its 

application to an inherent structure network 
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Preferential attachment is one possible way to obtain a scale-free network. We develop a self- 
consistent method to determine whether preferential attachment occurs during the growth of a 
network, and to extract the preferential attachment rule using time-dependent data. Model net- 
works are grown with known preferential attachment rules to test the method, which is seen to be 
robust. The method is then applied to a scale-free inherent structure network, which represents the 
connections between minima via transition states on a potential energy landscape. Even though 
this network is static, we can examine the growth of the network as a function of a threshold energy 
(rather than time), where only those transition states with energies lower than the threshold energy 
contribute to the network. For these networks we are able to detect the presence of preferential 
attachment, and this helps to explain the ubiquity of funnels on energy landscapes. However, the 
scale-free degree distribution shows some differences from that of a model network grown using the 
obtained preferential attachment rules, implying that other factors are also important in the growth 
process. 
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I. INTRODUCTION 

Scale- free networks are ubiquitous in nature [1, 2], 
technology [3, 4] and society [5, 6]. It is well known 
that a growth mechanism involving preferential attach- 
ment can lead to scale- free networks [7]. In the original 
Barabasi- Albert (B A) model [7] , a new node is added at 
each time step together with m new edges. These new 
edges are more likely to attach to old nodes with high 
degree, thus the higher the degree of a node, the faster 
its degree increases, leading to the power-law degree dis- 
tribution that is characteristic of scale- free networks. 

For linear preferential attachment, nodes gain edges 
with a probability LI that is proportional to their degree 
k. For a more flexible power-law form, LI oc k a , it has 
been shown that a scale- free network is only produced by 
the BA model if the exponent a = 1 [8]. For a < 1, the 
degree distribution follows a stretched exponential, and 
for a > 1 a single site connects to all the nodes. However, 
in more complex models, the resulting networks can be 
scale- free even if a ^ 1 [9-14]. 

Previously, we have analysed the properties of inher- 
ent structure (IS) networks [15-17], and here we want to 
test whether the preferential attachment model can help 
to explain the scale-free character of these networks. IS 
networks represent potential energy landscapes by tak- 
ing local minima in the potential energy, or inherent 
structures, to be nodes in the network, and edges link 
those minima that are directly connected by a transition 
state. This network provides a dynamically relevant rep- 
resentation of the landscape, since the low-temperature 
dynamics of a system can be considered in terms of a 
hopping between basins of attraction surrounding the 
minima via transition states [18]. IS networks for small 
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Lennard-Jones clusters have been shown to be scale-free 
[15]. Furthermore, indirect evidence that this topology is 
more general has come from the properties of the basins 
of attraction surrounding minima for supercooled liquids 
[19], and from the topology of more coarse-grained de- 
scriptions of the energy landscapes of proteins [20, 21]. 

Understanding the origin of the topology of the IS net- 
works is a particular challenge because, unlike many net- 
works, they are static. The topology is determined just 
by the interparticle potential and the size of the system. 
Therefore, a preferential attachment approach might, at 
first sight, seem totally inappropriate to these networks 
because they do not grow in time. However, here we wish 
to explore whether the IS networks can be understood 
in terms of a quasi-growth process. In particular, we 
will examine how the network evolves as a function of a 
threshold energy, where only those transition states that 
lie below that energy give rise to edges in the network. In 
this quasi-growth process the low-energy minima corre- 
spond to the 'older' nodes in the network. We have pre- 
viously shown that there is a strong correlation between 
the energy of a minimum and its degree with the 'older' 
lower-energy minima having higher degree [15], thus sug- 
gesting that a preferential attachment model might be 
fruitful. 

So far, preferential attachment has only been tested 
empirically in relatively few cases [9-14, 22, 23]. This is 
partly due to the lack of time-resolved data for growing 
networks, but a reliable method for determining the form 
of the preferential attachment is also lacking. The main 
problem is that the probability of a node with given de- 
gree gaining an edge, U(k,t) = k a / Y^i^i > depends on 
the size of the network through the denominator of this 
expression. 

A method that has been used previously is to consider 
two snapshots of the network a certain time apart [10-13]. 
The relationship between the number of edges gained by 
a node in this time and its degree in the initial network 
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can be determined. The degree of each node, and hence 
kf, is assumed to remain constant over this time pe- 
riod. The consequences of this approximation are not 
clear. Furthermore, the results will depend on the size of 
the time period used. Actor, scientific collaboration and 
protein interaction networks have been studied using this 
method, finding exponents between 0.75 and 1.1. 

Newman studied time-resolved data for scientific col- 
laboration networks [9], comparing the probability that 
a node with given degree gains an edge (as measured 
from the data) with that expected if preferential attach- 
ment was not occurring. This considers each step in se- 
quence, but effectively assumes that the time-dependent 
constant, ^fcf, is proportional to the size of the net- 
work. In the case that the average degree of the network 
remains constant and a = 1, this assumption is true. 
However, the average degree of many networks increases 
[10, 24-26], and exponents different to one have been 
measured [10, 11, 13]. 

The above methods proved inadequate for analysing 
the quasi-growth of the IS networks. Therefore, to over- 
come the deficiencies of the above approaches, in Sec- 
tion II we introduce a self-consistent, iterative method 
to determine a from time-dependent data describing the 
growth of a network. An initial trial value of the expo- 
nent a is assumed, and hence the constant of propor- 
tionality J2i can be estimated, which in turn allows 
an improved estimate of the exponent to be obtained. 
This process is repeated until convergence is achieved. In 
Section III, the new self-consistent method is applied to 
model networks, for which the exponent for preferential 
attachment is known, in order to determine the accuracy 
of the method. As the growth process is stochastic, some 
uncertainty in the value of a is inevitable. If each node 
gained precisely k a / J^i k? edges at each time step, the 
method would be exact. Finally, in Section IV, we apply 
this method to the IS network for the 13-atom Lcnnard- 
Jones cluster, LJ13. 



II. METHOD 

When a new node is added to the network at time 
t, we assume that the degree of the existing nodes to 
which the new node connects are chosen with probability 
H(k,t) oc n k (k,t)f(k), where n k (k,t) is the number of 
nodes with degree k at time t, and we assume f(k) to be 
time independent and to have the preferential attachment 
form f(k) = k a . In one time step, the expected number 
of edges gained by all nodes with degree k is then 



Afc(fc,i) = m(t)U(k,t) 



m(t)n k (k,t)f(k) 
ct(t) 



(1) 



where m(t) is the number of new edges added and c t (t) — 
£ fc , n k (k',t)f(k'). 

The basic idea for determining f(k) is based on invert- 
ing the above expression, m, AA; and n k can be simply 
measured at each time step, and Cj can be estimated by 



using a trial exponent. A fit to f(k) — k a gives a new 
exponent, which is then used to improve the estimate 
of c t . This process is repeated until self-consistency is 
achieved, i.e. the exponent converges. 
More specifically, rearranging Eq. 1 gives 



/(*) 



Ak(k,t)c t (t) 
m(t)n k (k,t) ' 



(2) 



As f(k) is assumed to be time independent, this could 
then be averaged over all appropriate time steps. How- 
ever, Eq. 2 diverges if nk{k,t) = 0. Therefore, to avoid 
this situation we sum both sides of Eq. 1 over t, effec- 
tively comparing the number of edges gained by all nodes 
with degree k at all times, J2 t Ak(k, t), with the expected 
number, ^ t (m{t)nk{k,t)f{k)/c t {t)). This then gives 



f(k) = 



E t Afc(M) 



Et( m (t)MKt)/c t (t))' 



(3) 



If one wished, the assumption that f(k) is independent 
of time could be checked by calculating a for different 
time ranges. 

Finally, a is obtained from f(k) using a linear least 
squares fit for log / = a log k. However, the statistics for 
high-degree nodes are poor, as there are less of them and 
they only occur at later time steps. There are high-degree 
nodes that are not chosen to get any new edges, giving 
f(k) = which cannot be taken into account in the log- 
log fit. An approach that has been used previously is to 
plot the cumulative distribution, giving an exponent of 
a — 1 [10, 11, 13]. However, for the examples we consider, 
we find that logarithmic binning over degree gives better 
results. The contribution of each degree to the bin is 
weighted by the number of time steps at which there are 
nodes with that degree, i.e. the number of terms in the 
sums of Eq. 3. This weighting was also used in the least 
squares fit. 

Many networks also gain new edges between two ex- 
isting nodes. These have been termed 'internal edges', 
whilst new edges connecting to a new node are 'external 
edges' [27]. When internal edges are added, the average 
degree of the network increases with time and the net- 
work is said to be accelerating [24-26]. This increases 
the global connectivity of the network, and is important 
in regulatory networks, such as biological networks [25]. 
The current method for measuring preferential attach- 
ment can equally well be applied to internal links, where 
the product of the degrees of the pair of existing nodes is 
chosen with probability H(kikj,t) oc n k (kikj,t)f(kikj), 
and f(kikj) — (kikj) a . Thus, preferential attachment 
can be studied separately for internal and external links. 



III. MODEL NETWORKS 

To test the method described in the previous section, 
we first apply it to model networks that have been grown 
using preferential attachment to see if the method can 
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FIG. 1: (Colour online) Degree distributions averaged over 
1000 networks, with 1509 nodes, for versions of the model 
excluding and allowing ME/SCs. 



correctly recover the exponent used. As we are develop- 
ing this method in order to apply it to an IS network, 
we choose the model networks to have some of the basic 
properties as the IS network that we consider in Section 
IV, e.g. the number of nodes (1509) and edges (20687). 
Furthermore, at each time step in the growth of the net- 
works, one of three possible processes occurs, namely the 
addition of external or internal edges, or the addition of 
two new nodes with a edge between them, and the rela- 
tive frequencies of the three events, which have a strong 
effect on network topology [24, 27], were taken to be the 
same as for the quasi-growth of the IS network. One edge 
is added at each time step, giving m — 1 for all times, 
also as for the IS network. New links between two new 
nodes cannot preferentially attach, but they are neces- 
sary to reproduce the correct event frequencies, and to 
avoid complications from initial conditions. 

A further complication associated with the introduc- 
tion of internal links is that this can lead to multiple 
edges (MEs) and self-connections (SCs), which can have 
a significant effect on network topology [28] . In the case 
of external links, ME/SCs are only relevant if more than 
one edge is added with the new node. ME/SCs are not 
present in the IS network, as is true of many other net- 
works, and so they are excluded from the model networks. 
To exclude ME/SCs when adding an internal link, two 
nodes are chosen from the set of pairs of nodes that would 
not give an ME/SC. This set of pairs are also used to de- 
termine Ct for these time steps. 

In the final network, there will be less edges between 
high-degree nodes compared to a network where ME/SCs 
are allowed. When a new edge is added, high-degree 
nodes are likely to be connected already and so cannot be 
connected if ME/SCs are excluded, but would be likely to 
be chosen if ME/SCs are allowed. In terms of the degree 
distribution, the maximum degree will be lower. The 
degree distribution obtained from this model is shown 



TABLE I: Model networks were grown using a given input 
exponent, and the output exponent was measured for external 
and internal edges. Results are for one realisation of the model 
network for each input exponent. 



Input exponent 


Output exponent 


external edges 


internal edges 


0.5 


0.4817 


0.4807 


0.8 


0.7823 


0.8053 


1.0 


0.9882 


0.9955 


1.2 


1.2038 


1.2044 


1.5 


1.6245 


1.5594 



in Fig. 1. If ME/SCs are excluded, there is a cut-off 
at high degree which is not present when ME/SCs are 
allowed. The cut-off is particularly extreme in this case 
as the network is very dense with high average degree 
«fc> = 27.4). 

The self-consistent method described is very successful 
in determining the preferential attachment rules in the 
model network. It is most accurate when the exponent is 
close to one (Table I), e.g. when a — 1 we obtain a result 
of 0.988 for external edges and 0.996 for internal edges, 
independent of the trial exponent used. Reasonable re- 
sults are obtained for 0.5 < a < 1.5, with a maximum 
error of around 0.12, increasing as a differs more from 
1. Exponents very different to one are unlikely to give 
scale- free networks [8], so it is reasonable to expect a to 
lie within this range. 

The exponent is somewhat sensitive to the the number 
of bins used in the least squares fit, as shown in Fig. 2, 
but the result is within 0.05 of the input exponent for 
reasonable choices for the numbers of bins. For very few 
bins, the least squares fit is inaccurate, whereas for very 
many bins, a significant number have zero value and so 
will not contribute. For both internal and external links, 
the slope is flatter between these extremes, implying that 
the method is robust with respect to the number of bins 
in that range. 

The optimal number of bins for both internal and ex- 
ternal links is consistent with approximately 60 data 
points per bin. The actual number of bins reflects the 
relative frequencies of the two types of link; for the cur- 
rent examples there are 19205 internal links and 1455 
external links. Analysis of 100 networks revealed that 
for external links, 25 bins is the optimal number, giving 
a mean a of 1.00067 and a standard deviation of 0.02280. 
Note that the method gives different results on different 
realisations of the network because of the stochastic na- 
ture of the network growth. For internal links, the op- 
timal number of bins was 300, with mean 1.00067 and 
standard deviation 0.00593. 

The method would be simpler if ME/SCs could be ig- 
nored when adding internal links. To test this, ME/SCs 
were excluded while making the network, but allowed in 
the analysis. This gives a — 0.779 for internal links, 
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FIG. 2: (Colour online) Variation of the exponent with the 
number of bins used in the least squares fit for the model, 
for (a) external and (b) internal links. The exponent used 
to make the network, a = 1, is also plotted. Error bars are 
standard error from the linear least squares fit. 



with an input exponent of one. This should be com- 
pared to a = 0.996, obtained when ME/SCs were ex- 
cluded in the analysis. The cause of the problems can be 
understood from Fig. 3(a), which shows fikikj) against 
k%kj. f(kikj) is too low for pairs of nodes with high 
kikj, and too high for pairs with low kikj. There are 
less edges between high-degree nodes than the analysis 
expects, because ME/SCs are excluded and high-degree 
nodes are likely to be connected already. Therefore, there 
are more edges between low-degree nodes. For compari- 
son, f(kikj) is shown in Fig. 3(b) for internal links with 
ME/SCs excluded in both the creation of the network 
and its analysis, giving a straight line with exponent close 
to 1. 

We also tested some of the alternative methods for 
extracting a that were described in the introduction [9- 
14, 22, 23]. We just give results for the current model 
networks, when the input exponent is one for both in- 
ternal and external links. Using the method that follows 
the number of edges obtained over a certain time interval 
by nodes with a given degree at the start of that interval 
[10-13, 22], we obtain a w 0.7. The method of New- 
man [9, 14, 23], which approximates the time-dependent 
constant in H(k,t), gives an exponent of a ~ 0.94 for 
external edges, but for internal edges, a least-squares fit 
gives an exponent of approximately 0.4. The plot is close 
to linear at low degree, but has a cut-off at high degree 
similar to that of Figure 3, which was due to the effects 
of ME/SCs. 

That the performance of these alternative methods on 
the current networks are significantly worse than our 
self-consistent approach is unsurprising given the greater 
number of approximations involved. Furthermore, it is 
also not straightforward to account for effects due to the 
exclusion of ME/SCs in either of these alternate methods, 




FIG. 3: fikikj) against kikj for internal links when (a) 
ME/SCs are excluded in making the network, but allowed 
in the analysis, (b) ME/SCs are also excluded in the analysis. 
a = 1 was used in the growth of the network, this is also 
shown. The trial exponent is 1. 



leading to further inaccuracies. However, we should also 
note that the current examples represent somewhat atyp- 
ical networks. In particular, they are unusually dense 
with a higher average degree than many other networks. 
Moreover, our aim was not to attempt to exhaustively 
compare the different methods of detecting preferential 
attachment, but rather only to determine which is most 
appropriate to study the IS networks. 



IV. CONNECTING UP AN ENERGY 
LANDSCAPE 

In previous analyses of IS networks, each transition 
state gives rise to an edge in the network (except when 
it leads to an ME/SC) no matter what the energy of the 
transition state involved. Here, we want to investigate 
the relationship between the IS networks and the topog- 
raphy of their potential energy landscape with a view to 
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FIG. 4: (a) Disconnectivity graph for LJ13. As the maximum 
transition state energy, E max , is increased, more minima be- 
come interconnected, (b) The increase in the number of nodes 
in the IS network with E max . The unit of energy is e, the pair 
well depth of the Lennard- Jones potential. 



better understanding the scale-free character of the IS 
networks. 

One method that has been particularly successful at 
visualising the topography of multi-dimensional poten- 
tial energy landscapes is the disconnectivity graph [29- 
32]. The disconnectivity graph for LJ13 is illustrated in 
Fig. 4(a). Each line in the graph represents a set of min- 
ima that are connected by paths that lie below a given 
energy threshold, E max . Each line begins at the energy of 
the corresponding minimum. As E max increases transi- 
tion states become accessible that link up sets of minima 
that were previously disconnected, and the correspond- 
ing lines in the disconnectivity graph join up. Finally, 
at large E max the network has a giant component that 
spans the whole potential energy landscape and there is 
a single line in the disconnectivity graph. When viewing 
Fig. 4(a), one should note that the disconnectivity anal- 
ysis has been performed at a series of discrete energy 
levels. This is purely for convenience in the generation 
and visualization of these graphs. 

Disconnectivity graphs are useful because they provide 
a visual representation of the barriers between different 
regions of the energy landscape. In the case of LJ13, the 
landscape has a single funnel topography, and this is clear 
from Fig. 4, which has a single dominant stem, i.e. most 
minima join directly to the set of minima containing the 
global minimum, and the barriers involved are relatively 
small. 

Here, in a similar way, we wish to examine the topology 
of the IS network, as E max increases, where only those 
transition states with energy less than E max can con- 
tribute edges to the network. Thus, we can look at the 
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FIG. 5: (Colour online) Average degree, (k), and degree of 
the global minimum, koM, for the IS network as a function 
of nodes in the network. 



dynamics of growth of the network, with E max playing 
a similar role to time. As the network grows one would 
expect new nodes and edges to be more likely to connect 
to lower-energy minima that have larger basins of attrac- 
tion and larger degree. Indeed, for the complete network 
a strong correlation between degree and minimum en- 
ergy has been noted [15, 16]. Therefore, we decided to 
investigate whether this growth of the network might be 
describable in terms of a preferential attachment model. 

Emax is increased such that only one transition state is 
added at each step, and minima (nodes) are only included 
in the network once they become connected to another 
minimum by a transition state. Edges in the network cor- 
respond to transition states, but not all transition states 
lead to edges, as some connect permutational isomers of 
the same minimum, forming self-connections, and more 
than one transition state may connect the same pair of 
nodes, forming a multiple edge. Transition states that 
would form ME/SCs are ignored. 

Fig. 4(b) illustrates how the number of nodes in the 
network increases with E max . The distribution of the 
number of minima as a function of energy is expected to 
be a Gaussian for bulk systems [33-35], and even for a 
small cluster such as LJ13, the distribution is a reasonable 
approximation to a Gaussian [16]. Therefore, the rate of 
growth of the number of nodes in the IS network goes 
through a maximum at an E max value near to the centre 
of this distribution (it will actually be slightly above, 
because a minimum is only added to the network once it 
becomes connected to another minimum by a transition 
state with energy less than E max ). 

The number of edges in the network increases faster 
than the number of nodes, as M ~ TV 156 . The distri- 
bution of transition states as a function of energy is also 
expected to be Gaussian, but where the centre of the 
Gaussian is displaced to higher energy compared to min- 
ima [36, 37]. Therefore, the number of edges increases 




FIG. 6: (Colour online) Cumulative degree distribution for 
the IS network at different stages of the quasi-growth process. 



more rapidly than the number of minima at larger E max . 
This result implies that the average degree increases as 
(fc) ~ jV - 56 , as shown in Fig. 5, i.e. the network is accel- 
erating. 

From Fig. 6, we can see how the degree distribution 
develops during the quasi-growth process. The overall 
form of the distribution is quite similar throughout the 
growth process, although the scale- free tail becomes more 
pronounced as the network grows and fluctuations are 
reduced. 

These features affect other network properties. For ex- 
ample, M increases more slowly than iV 2 , so the network 
becomes more sparse. This causes the clustering coef- 
ficient to decrease, although the network becomes more 
highly clustered than a random network with the same 
degree distribution, i.e. the clustering becomes more sig- 
nificant. Secondly, the accelerating nature of the network 
affects the average shortest path length of the network, 
lave- If the random graph result for l ave were to ap- 
ply, i.e. l ave ~ log N/ log(fc), then l ave for an accelerating 
graph should remain constant during growth [38]. In fact, 
after an initial increase the average shortest path length 
actually decreases with further growth. 

Indicators of preferential attachment can be found in 
the evolution of some of the network properties. For ex- 
ample, Fig. 5 shows the increase in degree of the global 
minimum, the 'oldest' node, and the one which has the 
highest degree in the final network, as the network grows; 
it has the form kcM ~ A - 98 . The exponent describing 
how the degree of a node increases with network size 
(number of nodes) can be measured for each node, and is 
distributed about one, implying that this behavior of the 
global minimum is fairly typical. The degree of a par- 
ticular node is expected to grow faster than the average 
degree, because the addition of new nodes with k = 1 re- 
duces the average degree. However, kcM increases much 
faster than expected from this effect, indicating that pref- 
erential attachment is significant during growth. 

Applying the self-consistent method of Section II, the 
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FIG. 7: (Colour online) Degree distributions for the IS net- 
work and two model networks using f(k) taken from the 
IS network results. The first model uses f(k) = fc 0,87 and 
f(hkj) = {kikj) - 63 , the second uses /(fc) = 67.53 + fc 1 ' 56 
and f(kikj) — (kikj) ' 63 for external and internal links re- 
spectively. For clarity, results are shown for one realisation of 
the model networks. When averaged over 1000 networks the 
results are qualitatively the same. 

exponents for preferential attachment were determined to 
be a = 0.87 ±0.05 for external links, and a = 0.63 ±0.05 
for internal links. These exponents depend on the num- 
ber of bins used in the least squares fit, as did those for 
the model network. However, the variation is very sim- 
ilar to that for the model networks in Section III, again 
allowing an appropriate number of bins to be chosen. 

Fig. 7 shows the degree distribution obtained for the 
model using the exponents found for the IS network. 
Note that the degree distribution for the IS network is 
only a power-law over a finite range. The degree dis- 
tribution for the model is very similar to that for the 
IS network for mid-degree nodes, 30 < k < 100, in 
the range where the IS network is scale-free. Perhaps 
unsurprisingly, the model fails in reproducing the very 
high-degree nodes with k > 200, and the behavior at low 
degree, where the IS network deviates from a scale-free 
distribution. 

In our analysis, correlations between nodes are not con- 
sidered. However, the IS network is assortative with re- 
spect to energy [16], i.e. nodes with similar energies are 
more likely to be connected. When a new node appears 
and creates a new external link, the nodes that have the 
most similar energies are the newest nodes, with lowest 
degrees. Therefore, f(k) for low-degree nodes is higher 
than expected for a preferential attachment rule, as can 
be seen in Fig. 8. The method used so far relies on f(k) 
following a power-law, but can be adapted to use differ- 
ent functions for /(fc). The only difficulty arises in ex- 
tracting coefficients from the data for /(fc). As a rough 
estimate, a power law does describe the slope fairly well. 
In this better fit is provided by /(fc) = k + k a . 

Using this form for /(fc) in our self-consistent approach 
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FIG. 8: (Colour online) f(k) against k for the IS network 
after convergence of our self-consistent approach to a = 0.87, 
assuming f(k) = k a . The best fits for f(k) — k a and f(k) oc 
ko + k a are also shown. 



gives f{k) — 67.33 + k 156 . Compared to the original 
power-law fit, the exponent is now much greater, showing 
strong preferential attachment and increasing the num- 
ber of external edges gained by high-degree nodes. Large 
kg means that low-degree nodes gain more external edges. 
This leads to the degree distribution shown in Fig. 7. It 
is clear that improving the form of f{k) has only a minor 
effect on the degree distribution. The power-law form, 
which is easier to implement, is a good approximation, 
but care should be taken in attaching significance to the 
value of the exponent. 



V. CONCLUSION 

We have introduced a self-consistent method to mea- 
sure preferential attachment in any network where time- 
resolved data is available. Tests of the method on model 
networks where the form of preferential attachment is 
known show that the method can determine this prefer- 
ential attachment accurately, although any such method 
will involve some uncertainty as network growth is a 
stochastic process. The method is not limited to power- 
law preferential attachment rules. However, this can pro- 
vide a simple approximation to more complex forms, giv- 
ing similar final degree distributions. Other topological 
properties, such as assortativity, may be affected by this 
approximation. We have also shown that multiple edges 
and self-connections can be very important, particularly 
if the networks studied are dense. Excluding ME/SCs 
leads to a cut-off at high degree in the degree distribution 
of the resulting network, and failure to take the absence 
of ME/SCs into account when measuring preferential at- 
tachment underestimates its strength. 

We have then applied this method to an IS network, 
which has a scale-free degree distribution. Although 
such networks are static, a helpful way to think about 



the properties of the landscape is in terms of how the 
landscape becomes connected up as a threshold energy 
Emax is raised. We were able to show that during this 
quasi-growth process, there is preferential attachment, 
i.e. that as new minima are added to the network they are 
more likely to add to the lower-energy high-degree nodes. 
Therefore, the topography of the energy landscape plays 
a key role in the development of the scale-free topology 
of the IS network. 

This interplay of topology and topography can help 
us understand why features like funnels [39, 40] are so 
generic to energy landscapes. At the bottom of such 
a funnel is a deep energy minimum that has no fur- 
ther downhill connections. The preferential attachment 
implies that higher-energy minima are likely to be con- 
nected to lower-energy minima, which, if not themselves 
funnel bottoms, are in turn are likely to be connected 
to even lower-energy minima, and so on. This naturally 
generates features on the energy landscape, where there 
is a convergence of pathways down to a particular low- 
energy structure, and where there is an overall gradi- 
ent of the landscape down towards this structure. These 
are the archetypal characteristics of a funnel on an en- 
ergy landscape, a landscape object that has featured so 
prominently in the understanding of how relaxation to 
a target low-energy configuration is possible on a high- 
dimensional energy landscape, such as in protein folding 
[41]. 

On a relatively low-dimensional landscape, such as for 
LJ13, the landscape is dominated by a single funnel sur- 
rounding the global minimum. But, this does not have 
to be the case for higher-dimensional landscapes, and so 
when there are deep minima that are well-separated in 
configuration space, each can give rise to their own fun- 
nel, thus resulting in a multiple-funnel landscape [31, 42]. 
On a purely topological level, and in network parlance, 
these funnels are evident in terms of different communi- 
ties surrounding high-degree nodes [17]. 

A model network grown using the preferential attach- 
ment rules discovered for the IS network reproduced the 
scale-free part of the degree distribution of the IS net- 
work fairly well. However, there were discrepancies at 
high and low degree, indicating that preferential attach- 
ment alone is not sufficient to describe the whole degree 
distribution. Of course, we could get an increasingly ac- 
curate description of the development of the network by 
using increasingly complex forms for f(k). However, as 
the forms become more complex, they are also likely to 
produce less physical insight. The use of a two-parameter 
expression for f(k) did, though, confirm the robustness 
of the preferential attachment result, and illustrated that 
the exact value of the exponent should be interpreted 
with caution. 

It is also possible to consider how the IS network 
changes as the form of the potential describing the in- 
teratomic interactions in the cluster is changed. In par- 
ticular, as the range of the potential decreases, new min- 
ima and transition states appear on the potential energy 
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landscape [43-45]. This will be studied in future work. ciety (JPKD) for financial support. 
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